Clustering of XML documents
Identifieur interne : 001964 ( Main/Exploration ); précédent : 001963; suivant : 001965Clustering of XML documents
Auteurs : Damien Guillaume [États-Unis] ; Fionn Murtagh [Royaume-Uni]Source :
- Computer Physics Communications [ 0010-4655 ] ; 2000.
English descriptors
- Teeft :
- Algorithm, Astronomical markup language, Astronomical object, Astronomical object names, Astronomical objects, Average number, Balanced clusters, Best keyword, Best score, Cluster, Collaboration graph, Cost function, Current_solution, Damien guillaume, Database, Document, Elsevier science, Euclidean distance, Extensible markup language, Goto step, Graph partitioning, Graph partitioning algorithm, Graph partitioning problem, Grid, Guillaume, Html, Html documents, Ident, Indexing vocabulary, Information retrieval, Initial solution, Java applet, Jean heyvaerts, Keyword, Keyword links, Keywords, Knowledge discovery, Links, Local minima problem, Many articles, Many documents, Markup, Markup language, Murtagh, Murtagh computer physics communications, Node, Noising, Noising algorithm, Noising method, Noising rate, Original data, Other clusters, Other documents, Partitioning, Partitioning algorithm, Regular grid, Same cluster, Same keyword, Same number, Same time, Server, Similar documents, Total number, User, User clicks, User interface, Xlink.
Abstract
Abstract: Self-organization or clustering of data objects can be a powerful aid towards knowledge discovery in distributed databases. The web presents opportunities for such clustering of documents and other data objects. This potential will be even more pronounced when XML becomes widely used over the next few years. Based on clustering of XML links, we explore a visualization approach for discovering knowledge on the web.
Url:
DOI: 10.1016/S0010-4655(99)00511-1
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 003671
- to stream Istex, to step Curation: 002A80
- to stream Istex, to step Checkpoint: 001793
- to stream Main, to step Merge: 001A08
- to stream Main, to step Curation: 001964
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title>Clustering of XML documents</title>
<author><name sortKey="Guillaume, Damien" sort="Guillaume, Damien" uniqKey="Guillaume D" first="Damien" last="Guillaume">Damien Guillaume</name>
</author>
<author><name sortKey="Murtagh, Fionn" sort="Murtagh, Fionn" uniqKey="Murtagh F" first="Fionn" last="Murtagh">Fionn Murtagh</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:D3AA01AB63527FA108046A9B121443AEFF4BA385</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1016/S0010-4655(99)00511-1</idno>
<idno type="url">https://api.istex.fr/ark:/67375/6H6-ZRKLL3V8-9/fulltext.pdf</idno>
<idno type="wicri:Area/Istex/Corpus">003671</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Corpus" wicri:corpus="ISTEX">003671</idno>
<idno type="wicri:Area/Istex/Curation">002A80</idno>
<idno type="wicri:Area/Istex/Checkpoint">001793</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">001793</idno>
<idno type="wicri:doubleKey">0010-4655:2000:Guillaume D:clustering:of:xml</idno>
<idno type="wicri:Area/Main/Merge">001A08</idno>
<idno type="wicri:Area/Main/Curation">001964</idno>
<idno type="wicri:Area/Main/Exploration">001964</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a">Clustering of XML documents</title>
<author><name sortKey="Guillaume, Damien" sort="Guillaume, Damien" uniqKey="Guillaume D" first="Damien" last="Guillaume">Damien Guillaume</name>
<affiliation wicri:level="2"><country xml:lang="fr">États-Unis</country>
<wicri:regionArea>National Center for Supercomputing Applications, Astronomy Department, University of Illinois at Urbana-Champaign, 1002 West Green Street, Urbana, IL 61801</wicri:regionArea>
<placeName><region type="state">Illinois</region>
</placeName>
</affiliation>
</author>
<author><name sortKey="Murtagh, Fionn" sort="Murtagh, Fionn" uniqKey="Murtagh F" first="Fionn" last="Murtagh">Fionn Murtagh</name>
<affiliation wicri:level="1"><country wicri:rule="url">Royaume-Uni</country>
</affiliation>
<affiliation></affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="j">Computer Physics Communications</title>
<title level="j" type="abbrev">COMPHY</title>
<idno type="ISSN">0010-4655</idno>
<imprint><publisher>ELSEVIER</publisher>
<date type="published" when="2000">2000</date>
<biblScope unit="volume">127</biblScope>
<biblScope unit="issue">2–3</biblScope>
<biblScope unit="page" from="215">215</biblScope>
<biblScope unit="page" to="227">227</biblScope>
</imprint>
<idno type="ISSN">0010-4655</idno>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0010-4655</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass><keywords scheme="Teeft" xml:lang="en"><term>Algorithm</term>
<term>Astronomical markup language</term>
<term>Astronomical object</term>
<term>Astronomical object names</term>
<term>Astronomical objects</term>
<term>Average number</term>
<term>Balanced clusters</term>
<term>Best keyword</term>
<term>Best score</term>
<term>Cluster</term>
<term>Collaboration graph</term>
<term>Cost function</term>
<term>Current_solution</term>
<term>Damien guillaume</term>
<term>Database</term>
<term>Document</term>
<term>Elsevier science</term>
<term>Euclidean distance</term>
<term>Extensible markup language</term>
<term>Goto step</term>
<term>Graph partitioning</term>
<term>Graph partitioning algorithm</term>
<term>Graph partitioning problem</term>
<term>Grid</term>
<term>Guillaume</term>
<term>Html</term>
<term>Html documents</term>
<term>Ident</term>
<term>Indexing vocabulary</term>
<term>Information retrieval</term>
<term>Initial solution</term>
<term>Java applet</term>
<term>Jean heyvaerts</term>
<term>Keyword</term>
<term>Keyword links</term>
<term>Keywords</term>
<term>Knowledge discovery</term>
<term>Links</term>
<term>Local minima problem</term>
<term>Many articles</term>
<term>Many documents</term>
<term>Markup</term>
<term>Markup language</term>
<term>Murtagh</term>
<term>Murtagh computer physics communications</term>
<term>Node</term>
<term>Noising</term>
<term>Noising algorithm</term>
<term>Noising method</term>
<term>Noising rate</term>
<term>Original data</term>
<term>Other clusters</term>
<term>Other documents</term>
<term>Partitioning</term>
<term>Partitioning algorithm</term>
<term>Regular grid</term>
<term>Same cluster</term>
<term>Same keyword</term>
<term>Same number</term>
<term>Same time</term>
<term>Server</term>
<term>Similar documents</term>
<term>Total number</term>
<term>User</term>
<term>User clicks</term>
<term>User interface</term>
<term>Xlink</term>
</keywords>
</textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Self-organization or clustering of data objects can be a powerful aid towards knowledge discovery in distributed databases. The web presents opportunities for such clustering of documents and other data objects. This potential will be even more pronounced when XML becomes widely used over the next few years. Based on clustering of XML links, we explore a visualization approach for discovering knowledge on the web.</div>
</front>
</TEI>
<affiliations><list><country><li>Royaume-Uni</li>
<li>États-Unis</li>
</country>
<region><li>Illinois</li>
</region>
</list>
<tree><country name="États-Unis"><region name="Illinois"><name sortKey="Guillaume, Damien" sort="Guillaume, Damien" uniqKey="Guillaume D" first="Damien" last="Guillaume">Damien Guillaume</name>
</region>
</country>
<country name="Royaume-Uni"><noRegion><name sortKey="Murtagh, Fionn" sort="Murtagh, Fionn" uniqKey="Murtagh F" first="Fionn" last="Murtagh">Fionn Murtagh</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Informatique/explor/SgmlV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001964 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001964 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Informatique |area= SgmlV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:D3AA01AB63527FA108046A9B121443AEFF4BA385 |texte= Clustering of XML documents }}
This area was generated with Dilib version V0.6.33. |